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Abstract 

This paper extends a polynomial-time parsing al- 
gorithm that resolves structural ambiguity in input 
sentences by calculating and comparing the deno- 
tations of rival constituents, given some m odel of 
the application environment (pchuler, 2001). The 



algorithm is extended to incorporate a full set of 
logical operators, including quantifiers and conjunc- 
tions, into this calculation without increasing the 
complexity of the overall algorithm beyond polyno- 
mial time, both in terms of the length of the in- 
put and the number of entities in the environment 
model. 

1 Introduction 

The development of speaker-independent mixed- 
initiative speech interfaces, in which users not only 
answer questions but also ask questions and give in- 
structions, is currently limited by the inadequacy 
of existing corpus-based disambiguation techniques. 
This paper explores the use of semantic and prag- 
matic information, in the form of the entities and 
relations in the interfaced application's run-time en- 
vironment, as an additional source of information to 
guide disambiguation. 

In particular, this paper extends an existing pars- 
ing algorithm that calculates and compares the de- 
notations of rival parse tree constituents in order 
t o resolve stru ctural ambiguity in input sentences 
(Schuler, 2001). The algorithm is extended to incor- 
porate a full set of logical operators into this calcu- 
lation so as to improve the accuracy of the resulting 
denotations - and thereby improve the accuracy of 
parsing - without increasing the complexity of the 
overall algorithm beyond polynomial time (both in 
terms of the length of the input and the number of 
entities in the environment model) . This parsimony 
is achieved by localizing certain kinds of semantic 
relations during parsing, particularly those between 
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quantifiers and their restrictor and body arguments 
(similar to the way dependencies between predicate 
and argument head words are localized in lexicalized 
formalisms such as tree adjoining grammars), in or- 
der to avoid calculating exponential higher-order de- 
notations for expressions like generalized quantifiers. 

2 Basic algorithm 

This se ction describe s the basic environment-based 
parser ( |Schuler, 200l| ) which will be extended in Sec- 
tion [| Because it will crucially rely on the denota- 
tions (or interpretations) of proposed constituents 
in order to guide disambiguatio n, the parser will be 
defined on categorial grammars ( Ajdukicwicz, 1935| ; 



Bar-Hillel, 1953), whose categories all have well de- 
fined types and worst-case denotations. These cat- 
egories are drawn from a minimal set of symbols C 
such that: 

NP 6 C and S E C, 

if 7, 5 € C then j/S £ C and j\6 £ C. 

Intuitively, the category NP describes a noun phrase 
and the category S describes a sentence, and the 
complex categories j/S and j\8 describe 'a 7 lacking 
a 6 to the right' and 'a 7 lacking a S to the left' 
respectively; so for example S\NP would describe a 
declarative verb phrase lacking an NP subject to its 
left in the input. 

The type T and worst-case (most general) denota- 
tion W of each possible category are defined below, 
given a set of entities £ as an environment: 



T(S) = t : truth value 
T(NP) = e : entity 
T( 7 /6) = (T(S),T( 7 )) 
T( 1 \6) = (T(5),T( 1 )) 



W(S) = {TRUE, FALSE} 
W(NP) = £ 

W(j/S) = W(S) x W(7) 
W(j\5) = W{8) x W{i) 



The denotation D of any proposed constituent is 
constrained to be a subset of the worst-case deno- 
tation W of the constituent's category; so a con- 
stituent of category NP would denote a set of en- 
tities, {ei,e2,...}, and a constituent of category 
S\NP would denote a set of entity x truth value 
pairs, { (ei, TRUE), (e 2 , FALSE),...}. Note that no 



denotation of a constituent can contain more than 
0(151") different elements, where v is a valency mea- 



pointers to produce a polynomial-sized shared f orest 



sure of the number of NP symbols occurring within 



the constituent's category. 

This paper will use the following definition of a 
categorial grammar (CG): 

Definition A categorial grammar G is a formal 
grammar (TV, £, P) such that: 

• E is a finite set of words w\ 

• P is a finite set of productions containing: 
7 — » w for all wgE, with 7 G C, 

7 — * 7/(5 (5 for every rule j/S —> . . . in P, 
7 — » <5 7\<5 for every rule 7\<5 in P, 

and nothing else; 

• N is the nonterminal set {7 | 7 — > . . . S P}. 

and the following deductive parser ,[|] which will be 
extended later to handle a richer set of semantic op- 
erations. The parser is defined with: 

• constituent chart items [i,j, 7] drawn from Iq x 
Iq X N, indicating that positions i through j in 
the input can be characterized by category 7; 

• a lexical item 7] for every rule 7 — > w G P if 
to occurs between positions i and j in the input; 

• a set of rules of the form: 
for all 7 - j/S S G P, i, J, fc G IS, 

J 7\*GP, i,i,*€lg. 



representation of al l possible grammatical trees ( Bil- 
lot and Lang, 1989| ) 



fc&SjLg] for all 7 



and can recognize an n-lcngth input as a constituent 
of category 7 (for example, as an S) if it can deduce 
the chart item [0,71,7]. 

This parser can be implemented in a dynamic pro- 
gramming algorithm, using the recursive function: 

k 

F(x)= \/ A 



(where x, a\ . . . are proposed constituents drawn 
from IgxI^xA, V = FALSE, and A = TRUE), 
by recording the result of every recursive sub-call to 
F(x) in a chart, then consulting this chart on sub- 
sequent calls to F(x) for the same x constituent.^ 
Since the indices in every rule's antecedent con- 
stituents a\ . . . each cover smaller spans than 
those in the consequent x, the algorithm will not 
enter into an infinite recursion; and since there are 
only n 2 |7V| different values of x, and only 2n differ- 
ent rules that could prove any consequent x (two rule 
forms for / and \, each with n different values of k), 
the algorithm runs in polynomial time: C(n 3 |iV|). 
The resulting chart can then be annotated with back 



1 Following Shieber et aj 1 1,995 ) 

2 Following Goodman (199£^ 



Traditional corpus-based parsers select preferred 
trees from such forests by calculating Viterbi scores 
for each proposed constituent, according to the re- 
cursive function: 



Sv(x) = max 

Ctl... djt s.t. — 



Y[S v (ai) ■ P(ai...a k \ x) 



These scores can be calculated in polynomial time, 
using the same dynamic programming algorithm as 
that described for parsing. A tree can then be se- 
lected, from the top down, by expanding the highest- 
scoring rule application for each constituent. 

The environment-based parser described here uses 
a similar mechanism to select preferred trees, but the 
scores are based on the presence or absence of enti- 
ties in the denotation (interpretation) of each pro- 
posed constituent Q 



Sd(x) = max y^Spfaj) + < 

ai ...a k s.t.^r^\ i=1 J [ 



1 ifP>(a;)^0 
otherwise 



where the denotation D{x) of a proposed constituent 
x is calculated using another recursive function: 



D{x) = [J 7r|X|^(a s 

a\... a,fc s.t. — — v 




R{x) if k = 
{ () } otherwise 



in which R(x) is a lexical relation defined for each 
axiom x of category 7 equal to some subset of 7's 
worst-case denotation W{^), as defined abovcQ The 
operator XI is natural (relational) join on the fields 
of its operands: 

AMB = {(e 1 ...e max(a>b) ) | (e h ..e a )eA, ( ei ...e b )eB} 

where a,b > 0; and n is a projection that removes 
the first element of the result (corresponding the 
most recently discharged argument of the head or 
functor category): 

nA = {(e 2 ...e a ) I (e h ..e a )eA} 

This interleaving of semantic evaluation and pars- 
ing for the purpose of disambiguation has much in 

3 Here, the score is simply equal to the number of non- 
empty constituents in an analysis, but other metrics are pos- 
sible. 

4 So a lexical relation for the constituent 'lemon' of 
category NP would contain all and only the lemons in 
the environment, and a lexical relation for the con- 
stituent 'falling' of category S\NP would contain a map- 
ping from every entity in the environment to some truth 
value (TRUE if that entity is falling, FALSE otherwise): 
e.g. {(lemoni,TRUE), (lemon 2 , FALSE), ...}. 




Figure 1: Denotation-annotated forest for 'lemon in bin by machine. 



common with that of Dowding et al. (1994), except 



that in this case, constituents are not only scman- 
tically type-checked, but are also fully interpreted 
each time they are proposed. 

Figure ^ shows a sample denotation-annotated 
forest for the phrase 'the lemon in the bin by the 
machine', using the lexicalized grammar: 

lemon, bin, machine : NP 

the : NP/NP 



m, by : 1NP\1NP/1NP 



in which Hie denolalioii of each conslilueiil (the sol 

in each rectangle) is calculated using a join on the 
denotations of each pair of constituents that combine 
to produce it. In this example, the right-branching 
tree would be preferred because the denotation re- 
sulting from the composition at the root of the other 
tree would be empty. 

Since this use of the join operation is linear on the 
sum of the cardinalities of its operands, and since 
the denotations of the categories in a grammar G 
are bounded in cardinality by C(|£^| u ) where v is the 
maximum valency of the categories in G, the total 
complexity of the above algorithm can be shown to 
be 0(n 3 \£\ v ): polynomial not only on the length of 
t he input n, bu t also on the size of the environment £ 
( |Schulcr, 200l| ). 



3 Extended algorithm 

The above algorithm works well for attaching ordi- 
nary complements and modifiers, but as a semantic 
theory it is not sufficiently expressive to produce cor- 
rect denotations in all cases. For example, the lexical 
relations defined above are insufficient to represent 
quantifiers like 'no' (using category NP/NP) in the 



phrase 'the boy with no backpack. '[] A similar prob- 
lem occurs with conjunctions; for example, the word 
'and' (using category NP\NP/NP) in the phrase 'the 
child wearing glasses and blue pants', also cannot 
be properly represented as a lexical relation.^] This 
raises the question: how much expressivity can be 
allowed in a shared semantic interpretation without 
exceeding the tractable parsing complexity neces- 
sary for practical environment-based par sing 7 



In traditional categorial semantics (Montague 



1973; Barwise and Cooper, 1981; Keenan and Stavi 



1986) quantifiers and noun phrase conjunctions de- 



note higher-order relations: that is, relations be- 
tween whole sets of entities instead of just be- 
tween individuals. Under this interpretation, a 
quantifier like 'no' would denote a set of pairs 
{(Ai, Si), (A 2 , B 2 ), ■ ■ ■} where each Ai and Bi are 
disjoint subsets of £ , corresponding to an accept- 
able pair of restrictor and body sets satisfying the 
quantifier 'no'. Unfortunately, since the cardinalities 
of these higher-order denotations can be exponential 
on the size of the environment £ (there are 2' f ' pos- 
sible subsets of £ and 2 2 ' £ possible combinations of 
two such subsets), such an approach would destroy 
the polynomial complexity of the environment-based 
parsing algorithm. 



5 Assigning the identity relation {(ei,ei), (e2,e2), ■ • • } to 
the quantifier would incorrectly yield the set of boys with a 
backpack as a denotation for the full noun phrase; and assign- 
ing the converse relation (from each entity in the environment 
to every other entity {{ei , ea), (ei, ea), . . . }) would incorrectly 
yield the set of boys with anything that is not a backpack. 

6 The identity relation {{ei, e%, e\), (e2, e2, e2), • • • }, which 
yields a correct interpretation in verb phrase conjunction, 
would yield an incorrect denotation for the noun phrase 
'glasses and blue pants,' containing only entities which are 
at once both glasses and pants. 



However, if the number of possible higher-order 
functions is restricted to a finite set (say, to some 
subset of words in a lexicon), it becomes tractable 
to store them by name rather than by denotation 
(i.e. as sets). Such function can then discharge all 
their first-order arguments in a single derivational 
step to produce a first-order result, in order to avoid 
generating or evaluating any higher-order partial re- 
sults. Syntactically, this would be analogous to com- 
posing a quantifier with both a noun phrase restric- 
tor and a body predicate (e.g. a verb or verb phrase) 
at the same time, to produce another first-order 
predicate (e.g. a verb phrase or sentence). Since 
a generalized quantifier function merely counts and 
compares the cardinalities of its arguments in a lin- 
ear time operation, this analysis provides a tractable 
shortcut to the exponential calculations required in 
the conventional analysis. 

Note that this analysis by itself does not admit 
productive modification of quantifiers (because their 
functions are drawn from some finite set) or of quan- 
tified noun phrases (because they are no longer de- 
rived as a partial result). This causes no disruption 
to the attachment of non-conjunctive modifiers, be- 
cause ordinary syntactic modifiers of quantifier con- 
stituents are seldom productive (in the sense that 
their composition does not yield functions outside 
some finite set), and syntactic modifiers of NP con- 
stituents usually only modify the restrictor set of the 
quantifier rather than the entire quantified function, 
and can therefore safely be taken to attach below 
the quantifier, to the unquantified NP. 

But this is not true in cases involving conjunc- 
tion. Conjoined quantifiers, like 'some but not all,' 
cannot always be defined using a single standard lex- 
ical function; and conjunctions of quantified noun 
phrases, like 'one orange and one lemon', cannot 
be applied to unquantified subconstituents (syntac- 
tically, because this would fail to subsume the sec- 
ond quantifier, and semantically, because it is not 
the restric tor se ts which are conjoined). Keenan 
and Stavi (1986) model conjunctions of quantifiers 
and quantified noun phrases using lattice operations 
on higher-order sets, but as previously stated, these 
higher-order sets preclude tractable interleaving of 
semantic interpretation with parsing. 

The solution proposed here is to treat each quan- 
tifier or quantified noun phrase conjunction as an el- 
liptical conjunction of two complete first-order pred- 
icates (e.g. verb phrases or sentences), each subsum- 
ing a different quantifier and noun phrase restrictor 
(in the case of NP conjunction), but sharing or du- 
plicating a common body predicate. This analysis 
requires multiple components to keep track of the 
duplicated material above the conjunction, but as 
long as the number of components is bounded, the 
polynomial complexity of the parsing algorithm is 




(duplicated) (unduplicated) (unduplicated) 
Figure 2: Duplicated verb in NP conjunction. 
retained.Fl 

Figure^ shows a duplicated verb predicate in the 
derivation of an NP conjunction. The conjoined 
constituents (the shaded regions in the figure) are 
each composed of two components: one for the NP 
itself, containing the quantifier and the restrictor 
predicate, and one for the verb which supplies the 
body predicate of the quantifier. Since the conjoined 
constituents both correspond to complete quanti- 
fier expressions with no unsatisfied first-order argu- 
ments, their categories are that of simple first-order 
predicates (they are each complete verb phrases in 
essence: 'containing one orange' and 'containing one 
lemon'). The conjunction then forms a larger con- 
stituent of the same form (the unshaded outline 
in the figure), with a lower component containing 
the conjoined constituents' NP components concate- 
nated in the usual way, and an upper component in 
which the conjoined constituents' non-NP compo- 
nents are identified or overlapped. If the duplicated 
components do not cover the same string yield, the 
conjunction does not apply. 

Note that, since they are only applied to ordinary 
first-order predicates (e.g. sentences or verb phrases) 
in this analysis, conjunctions can now safely be as- 
signed the familiar truth-functional denotations in 
every casej^j Also, since the resulting constituent 
has the same number of components as the conjoined 
constituents, there is nothing to prevent its use as 
an argument in subsequent conjunction operations. 

A sample multi-component analysis for quantifiers 
is shown below, allowing material to be duplicated 
both to the left and to the right of a conjoined NP: 

some,all,no,etc. : A\NP g • NP g \NP g • NP 9 /NP e 
X/NP q • NP g \NP, ■ NP 9 /NP e 

The lexical entry for a quantifier can be split in this 



7 Dahl and McCord (L983) propose a similar duplication 
mechanism to produce appropriate semantic representations 
for NP and other conjunctions, but for different reasons. 

8 e.g. for the word 'and': {{...TRUE, ...TRUE, ...TRUE), 
{..TRUE, ..FALSE, ..FALSE), {..FALSE, ..TRUE, ..FALSE), 
{..FALSE, ..FALSE, ..FALSE)} 



way into a number of components, the last (or low- 
est) of which is not duplicated in conjunction while 
others may or may not be. These include a com- 
ponent for the quantifier NP g /NP e (which will ulti- 
mately also contain a noun phrase restrictor of cate- 
gory NP 6 ), a component for restrictor PPs and rela- 
tive clauses of category NP g \NP g that are attached 
above the quantifier and duplicated in the conjunc- 
tion, and a component for the body (a verb or verb 
phrase or other predicate) of category A\NP 9 or 
X/NP q . The subscript q specifies one of a finite 
set of quantifiers, and the subscript e indicates an 
unquantified NP. 

The deductive parser presented in Section ^ can 
now be extended by incorporating sequences of rec- 
ognized and unrecognized components into the con- 
stituent chart items. As constituents are com- 
posed, components are shifted from the unrecog- 
nized sequence 71 • • • 7 C to the recognized sequence 
Jij 7i) * " " {icijci 7c)j until the unrecognized se- 
quence is empty. 

The extended parser is defined with: 

• chart items of the form [i,j, A, S], where A is 
a sequence of unrecognized components 7, X is 
a sequence of recognized components (0,6,7), 
and i,j, k, a, b, c are indices in the input. Each 
item A-j, (h,ji,ji) ■ ■ ■ (i c , jc,7c)] indicates 
that the span from i to j in the input can be 
characterized by the categories 71 through 7 C at 
positions i\ to ji through i c to j c respectively, so 
that if these spans are concatenated in whatever 
order they occur in the input string, they form 
a grammatical constituent of category 7 with 
unrecognized components A. 

• a lexical item [i, j, 7, (i, j, 7)] for every rule 7 — > 
w e P if w occurs between positions i and j in 
the input; 

• a set of rules for all i, j, k, a,b,c € Iq as below. 
Two rules to invoke left and right function ap- 
plication to an existing component: 



[i,k,j/8,(i,k,j/8)] [k,j,A-S,(k,b,S/e)-Ti] 
[i,j,A-7,(i,6,7/e)-S] 7 

[k,j,-f\S,(k,j,-f\S)] [i,fc,A-a,(a,M\e)-£] 
[i,j,A'7,(o,j,7\e)'S] 



7/5 SeP, 
7^5 7\<5eP, 



Two rules to invoke left and right function ap- 
plication to a fresh component: 



[i,k,f/S,(i,k,f/S)] [fcj,A-7/g-g.S] 
[ij,A-7,<i,fc,7/<5)-£] 

[k,j,-r\S,{k,j,-r\S)] [i,fc,A- 7 \g-g,S] 
[i,j,A-7,(fcj,7\5)-S] 



7^7/5 Sep, 

7^<5 7\<5eP, 



Two rules to discharge empty components: 

[i,j,A-l/SS,Z] [i,j,A-t\S-S,T,] 
[i,j,A-7,B] [i,j,A-7,E] 

Three rules to skip conjunctions, by adding a 
gap between the components in a constituent 



(the first rule consumes the conjunction to cre- 
ate a partial result of category Conj^, and the 
latter two use this to skip the opposing NP): 

[k j A -8 S] 
[i,j,A-Conji,S] [' AConj .(i.fc.Conj)] 

RIA^S] ['' fc ' A - g '-] [i,j,A-3,S] [^.A-Conj^,-] 

Two rules to reassemble discontinuous con- 
stituents (again, using a partial result Conj^ to 
reduce the number of ranging variables): 

[a,c,Conj. (a.c.Conj)] [z,j.7,S- {c.b.S)] 
[i,j,7,'s-(a,6,ConjJ>j 

[i,j,7,S-<e,&,Conj^)] [i,j, 7 ,S- (a,c,(5)] 

Two rules to combine adjacent components: 

[i,j,7,S>,c,<Ve)-(c,&, £ )] [i,j,7,S-(c,fc,a\e)-(q,c,£)] 
[t,j,7,S.(o,6,<5)] [t,j,7,E.(o,b,<5)] 

And one rule to apply quantifier functions: 

lij,7,E-<a,6,5 e )J 

The parsing and scoring functions remain identi- 
cal to those in Section |[ but an additional k = 1 
case containing a modified projection function 7r is 
now added to the interpretation function, in order 
to make the denotations of quantified constituents 
depend on their associated quantifiers: 



(R(x) 



D(X): 

a±... a,k s.t 



u 



if k = 



n q D(ai) if k = 1 and 

oi _ j...(...S q )] 
x ~ [...(...5 e )j 

^D(ai) otherwise 

V<=1 



The modified projection function evaluates a quan- 
tifier function q on some argument denotation A, 
comparing the cardinality of the image of the re- 
strictor set in A with the the cardinality of image of 
the intersected restrictor and body sets in A:P| 

TTq A = {(e 2 ...e a ,t) I (.,e 2 ...e a ,-)eA, t = q{\R\, \S\) 
R = AM{{-,e 2 ...e a ,-)}, 
S = iM{(.,e2...e„,TRUE)}} 

This algorithm parses a categorial grammar in the 
usual way - constituents are initially added to the 
chart as single components covering a certain yield 
in the input string (the indices of the component 
are the same as the indices of the constituent itself) , 
and they are combined by concatenating the yields 
of smaller constituents to make larger ones - until a 
conjunction is encountered. When a conjunction is 
encountered immediately to the left or right of a rec- 
ognized constituent constituent x, and another con- 
stituent of the same category is found immediately 



^Following Keenan and Stavi (1986) 



containing 1 one orange 2 an d 3 one lemon 4 

[0,1,S\NP,/NP,,, [1,2,X\NP 3 -NP 3 \NP 3 -NP 3 , [2,3,Conj, [3, 4, X\NP 3 • NP 3 \NP 3 ■ NP 3 , 

(0,1,S\NP 9 /NP,-)] (1,2,NP 3 >] {01,02,03,04} <2,3,Conj>] <3,4,NP 3 )] {h,l 2 ,h} 

{{01, xx), {h,xi), (h,x 3 )} 

(1) 

[1, 4, X\NP 3 -NP 3 \NP 3 -NP 3 , (1,2, NP 3 )] {o 1; o 2 , 03, o 4 } 

(2) 

[1,4,X\NP 3 -NP 3 \NP 3 •NP 3 ,(3,4,jVP 3 >] {h,h,h} 

(3) 

[1,4,X\NP 3 -NP 3 ,(l,2,iVP 3 >] {01,02,03,04} 

(4) 

[1,4,X\NP 3 -NP 3 ,(3,4,iVP 3 >] {h,h,h} 

(5) 

[0,4,S\NP q ,(0,l,S\NP 9 /NP 3 ) -<1,2,NP 3 )] {<oi,n>} 

(6) 

[0,4,S\NP„(0,1,S\NP,/NP 3 > • <3,4,NP 3 )] {<i a ,xi>, (h,x 3 )} 

(7) 

[0, 4, S\NP q , (0, 1, S\NP,/NP e ) • (1, 2, NP e >] {zi} 

(8) 

[0,4,S\NP„(0,l,S\NP,/NP e > • <3,4,NP e >] {asj.zs} 

(9) 

[0, 4, S\NP q , (0, 1, S\NP 9 /NP £ ) • (1, 4, NP e >] {xi} 

(10) 

[0,4,S\NP„(0,4,S\NP q >] {xi} 



Figure 3: Sample derivation of conjoined NP. 



beyond that conjunction, the parser creates a new 
constituent that has the combined yield of both con- 
stituents, but copies at's component yield (the string 
indices of x's original components) with no change. 
This has the effect of creating two new constituents 
every time two existing constituents are conjoined: 
each with a different component yield, but both with 
the same (combined) constituent yield. These new 
discontinuous constituents (with component yields 
that do not exhaust their constituent yields) are still 
treated as ordinary constituents by the parser, which 
combines them with arguments and modifiers until 
all of their argument positions have been success- 
fully discharged, at which point pairs of discontinu- 
ous constituents with the same constituent yield can 
be reassembled into whole - or at least less discon- 
tinuous - constituents again. 

A sample derivation for the verb phrase 'con- 
taining one orange and one lemon,' involving con- 
junction of existcntially quantified noun phrases, is 
shown in Figure using the above parse rules and 
the lexicalized grammar: 

containing : S\NP g /NP,/ 

one : X\NP q ■ NP ? \NP 9 • NP,/NP £ 
X/NP q • NP 9 \NP 9 • NP g /NP e 
orange, lemon : NP C 
and : Conj 

First the parser applies the skip conjunction rules 
to obtain the discontinuous constituents shown af- 
ter steps (1) and (2), and a component is discharged 



from each of the resulting constituents using the 
empty component rule in steps (3) and (4). The 
constituents resulting from (3) and (4) are then com- 
posed with the verb constituent for 'containing' in 
steps (5) and (6), using the left attachment rule for 
fresh components. The quantifiers are then applied 
in steps (7) and (8), and the resulting constituents 
are reassembled using the conjunction rules in step 
(9). The adjacent components in the constituent 
resulting from step (9) are then merged using the 
combination rule in step (10), producing a complete 
gapless constituent for the entire input. 

Since the parser rules are fixed, and the number of 
components in any chart constituent is bounded by 
the maximum number of components in a category 
(inasmuch as the rules can only add a component 
to the recognized list by subtracting one from the 
unrecognized list), the algorithm must run in poly- 
nomial space and time on the length of the input 
sentence. Since the cardinality of each constituent's 
denotation is bounded by \£\ v (where £ is the set 
of entities in the environment and v is the maxi- 
mum valency of any category) , the algorithm runs in 
worst-case polynomial space on \£\; and since there 
is no more than one set composition operation per- 
formed when a rule is applied, and each composition 
operation runs in worst-case quadratic time on the 
size of its composed sets (due to the quantifier oper- 
ation), the algorithm runs in worst-case polynomial 
time on \£\ as well. 



4 Evaluation 



5 Conclusion 



The extended parser described above has been im- 
plemented and evaluated on a corpus of 340 spo- 
ken instructions to simulated human-like agents in 
a controlled 3-D environment (that of children run- 
ning a lemonade stand, which was deemed suitably 
familiar to undergraduate student subjects). The 
parser was run on the word lattice output of an 
off-the-shelf speech recognizer (CMU Sphinx II) and 
the parser chart was seeded with every hypothesized 
word. The parser was also compared with the rec- 
ognizer by itself, in order to determine the degree to 
which an environment-based approach could com- 
plement corpus-based disambiguation. The systems 
were evaluated as word recognizers (i.e. ignoring the 
brackets in the parser output) on the first 100 sen- 
tences of the corpus (corresponding to the first seven 
of 33 subjects); the latter 240 sentences were re- 
served for training the recognizer and for developing 
the grammar and semantic lexicon. 

The average utterance length was approximately 
three seconds (subsuming about 300 frames or posi- 
tions in the parser chart), containing an average of 
nine words. Parsing time averaged under 40 seconds 
per sentence on a P4-1500MHz, most of which was 
spent in forest construction rather than denotation 
calculation. 

Accuracy results show that the parser was able to 
correctly identify a significant number of words that 
the recognizer missed (and vice versa), such that a 
perfect synthesis of the two (choosing the correct 
word if it is recognized by either system) would pro- 
duce an average of 8 percentage points more recall 
than the recognizer by itself on successful parses, 
and as much as 19 percentage points more for some 
subject s:0 
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which indicates that the environment may offer a 
useful additional source of information for disam- 
biguation. Though it may not be possible to imple- 
ment a perfect synthesis of the environment-based 
and corpus-based approaches, if even half of the 
above gains can be realized, it would mark a sig- 
nificant advance. 



This paper has described an extension to an 
environment-based parsing algorithm, increasing its 
semantic coverage to include quantifier and conjunc- 
tion operations without destroying its polynomial 
worst-case complexity. Experiments using an imple- 
mentation of this algorithm on a corpus of spoken 
instructions indicate that 1) the observed complex- 
ity of the algorithm is suitable for practical user in- 
terface applications, and 2) the ability to draw on 
this kind of environment information in an inter- 
faced application has the potential to greatly im- 
prove recognition accuracy in speaker-independent 
mixed-initiative interfaces. 
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